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Abstract — In this paper, pilot-assisted transmission over time- 
selective flat fading channels is studied. It is assumed that 
noncausal and causal Wiener filters are employed at the receiver 
to perform channel estimation with the aid of training symbols 
sent periodically by the transmitter. For both filters, the variances 
of estimate errors are obtained from the Doppler power spectrum 
of the channel. Subsequently, achievable rate expressions are 
provided. The training period, and data and training power 
allocations are jointly optimized by maximizing the achievable 
rate expressions. Numerical results are obtained by modeling 
the fading as a Gauss-Markov process. The achievable rates of 
causal and noncausal filtering approaches are compared. For 
the particular ranges of parameters considered in the paper, the 
performance loss incurred by using a causal filter as opposed to 
a noncausal filter is shown to be small. The impact of aliasing 
that occurs in the undersampled version of the channel Doppler 
spectrum due to fast fading is analyzed. Finally, energy-per-bit 
requirements are investigated in the presence of noncausal and 
causal Wiener filters. 

I. Introduction 

In wireless communications, channel conditions vary over 
time due to mobility and changing environment. If the channel 
conditions are not known a priori, practical wireless systems 
generally employ training sequences to perform channel esti- 
mation, receiver adaptation and optimal decoding [5]. Cavers 
in [1] and [2] conducted one of the early studies in this 
area and provided an analytical approach to the design of 
pilot-assisted transmissions. Recently, there has been much 
interest in the optimization of training parameters using an 
information-theoretic approach. Hassibi and Hochwald [4] 
considered the multiple-antenna Rayleigh block fading channel 
and optimized the power and duration of training signals by 
maximizing a capacity lower bound. Adirredy et al. [3] investi- 
gated the optimal placement of pilot symbols and showed that 
the periodical placement maximizes the data rates. In general, 
the amount, placement, and fraction of pilot symbols in the 
data stream have considerable impact on the achievable data 
rates. 

Considering adaptive coding of data symbols without feed- 
back to the transmitter, Abou-Faycal et al. [10] studied the data 
rates achieved with pilot-symbol-assisted modulation (PSAM) 
over Gauss-Markov channels. The authors in [11] also studied 
the PSAM over Gauss-Markov channels and analyzed the 
power allocation of data symbols when the pilot symbol has 
fixed power They showed that the power has a decreasing 
character with respect to the distance to the pilot symbol. In 
similar settings, [12] analyzed the training power when the 



data power is distributed uniformly. More recently, we in [13] 
jointly optimized the pilot symbol period and power allocation 
among pilot and data symbols by maximizing the achievable 
rates in Gauss-Markov fading channels. Ohno and Giannakis 
[7] considered general slowly-varying fading processes. Em- 
ploying a noncausal Wiener filter for channel estimation at the 
receiver, they obtained a capacity lower bound and optimized 
the spacing of training symbols and training power. Baltersee 
et al. in [8] and [9] have also considered using a noncausal 
Wiener filter to obtain a channel estimate, and they optimized 
the training parameters by maximizing achievable rates in 
single and multiple antenna channels. 

In this paper, we study training-based transmission and 
reception schemes over a-priori unknown, time-selective 
Rayleigh fading channels. Since causal operation is crucial 
in real-time, delay-constrained applications, we consider the 
use of causal, as well as noncausal, Wiener filters for channel 
estimation. We optimize the training parameters by maximiz- 
ing a capacity lower bound. Although the treatment is general 
initially, we concentrate on the Gauss-Markov channel model 
for numerical analysis. As another contribution, we analyze 
fast fading channels and the impact upon the performance of 
aliasing due to under-sampling of the channel. 

II. Channel Model 



The time-selective Rayleigh channel is modeled as 

Vk^hkXk+7ik A: = 1,2,3,... 



(1) 



where is the complex channel output, Xk is the complex 
channel input, {uk} is assumed to be a sequence of inde- 
pendent and identically distributed (i.i.d.) zero-mean Gaussian 
random variables with variance af^, and {hk} is the sequence 
of fading coefficients, {hk} is assumed to be a zero-mean sta- 
tionary Gaussian random process with power spectral density 
Sh{c-''^). It is further assumed that is independent of hk 
and Uk- While both the transmitter and the receiver know the 
channel statistics, neither has prior knowledge of instantaneous 
realizations of the fading coefficients. Note that the discrete- 
time model is obtained by sampling the received signal every 
Ts seconds. 

III. Pilot Symbol-Assisted Transmission and 
Reception 

We consider pilot-assisted transmission where periodically 
inserted pilot symbols, known by both the sender and the 
receiver, are used to estimate the fading coefficients of the 



channel using a Wiener filter. We assume the simple scenario 
where a single pilot symbol is transmitted every M symbols 
while M — 1 data symbols are transmitted in between the pilot 
symbols. We consider the following average power constraint 
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E[\xk\^] <P ^ = 0,1,2, 



(2) 



k=lM 



on the input. Therefore, the total average power allocated to 
the pilot and data transmission over a duration of M symbols 
is limited by MP. Communication takes place in two phases. 
In the training phase, the transmitter sends pilot symbols and 
the receiver estimates the channel coefficients. In this phase, 
the channel output is given by 
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(5) 

Also shown in [7], it can easily be seen from [6] that the 
channel MMSE for the noncausal Wiener filter at time Ml + m 
is given by 

1 r PtlSh^raien? 



where Pt again denotes the power allocated to one pilot 
symbol. On the other hand, from [6], we can also easily find 
that the channel MMSE at time All + m for the causal Wiener 
filter is given by 
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(7) 
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The operators {}+ and {}^ yield the causal and the anti-causal 
part of the function to which they are applied, respectively. 
Note that, using the orthogonality principle, we have 



(9) 



where a~ is the variance of the channel estimate at time 
Ml+m. Similarly as in [13], treating the error in (|4]i as another 
source of additive noise and assuming that 



is zero-mean Gaussian noise with variance 



(10) 



(11) 



where Pt is the power allocated to the pilot symbol. In the 
data transmission phase, data symbols are transmitted. In this 
phase, the input-output relationship can be written as 

Vk = hkXk + hkXk + rik IM < k < {I + 1)M - 1 (4) 

where hk and hk are the estimated channel coefficient and the 
error in the estimate at sample time k, respectively. Note that 
hk and hk for IM < k < {I + 1)M — 1 are uncorrected 
zero-mean circularly symmetric complex Gaussian random 
variables with variances cr- and a~ , respectively. 

IV. Achievable Rates 

For the estimation of the fading coefficients, we assume 
that a Wiener filter, which is the optimum linear estimator in 
the mean-square sense, is employed at the receiver. Note that 
since pilot symbols are sent with a period of M, the channel is 
sampled every MTs seconds. Therefore we have to consider 
the under-sampled version of the channel's Doppler spectrum 
which is given by 



we obtain the following lower bound on the channel capacity: 



(12) 



where ^ is a zero-mean, unit-variance, circularly symmetric 
complex Gaussian random variable and P,„ ~ E \\xMi+m\'^\ 
denotes the power of the m*'* data symbol after the pilot 




symbol. Note that the error variance a~ 



depends in 



general on m and hence the location of the data symbol with 
respect to the pilot symbol. However, if the fading slowly 
varies and the channel is sampled sufficiently fast, we can 
satisfy 2-Kfo < tt/M where fu is the maximum Doppler 
frequency of the channel. In this case, M < jj^. We can 
see from the Nyquist's Theorem that there is no aliasing in 
the under-sampled version of the channel's Doppler spectrum, 
and hence |5,.,™(eJ'"')| = \ShAe'")\ = \Sh{e^'"'/^'^)\/M, for 
TO G [1, 7\f — 1] and — tt < w < tt. Therefore, (|6]l reduces to 
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and also ([7|l can be expressed as 
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where 



where L* (e-'™ ) is obtained from the canonical factorization of 
the channel output's sampled power spectral density at to = 0, 
which is given by 
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Therefore, under this assumption, the error variances become 
independent of to. Since the estimate quality is the same for 
each data symbol regardless of its position with respect to 



2 



the pilot symbol, uniform power allocation among the data 
symbols is optimal and we have 

MP - Pt 



M - 1 

Then, we can rewrite (fT2] ) as 
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(17) 



V. Optimizing Training Parameters in 
Gauss-Markov Channels 

In this section, we assume that the fading process is modeled 
as a first-order Gauss-Markov process, whose dynamics is 
described by 




ahk-i+Zk 0<a<l /c = l,2,3, 



(18) 



where {zk] are i.i.d. circular complex Gaussian variables with 
zero mean and variance equal to (l-a^)cr^. The power spectral 
density of the Gauss-Markov process with variance cr^ is given 

s^i^n - ,,Vf"% V (19) 

1 + — 2acos[w) 

Note that Sh{e^^) in ( fT9l l is not bandlimited and hence the 
condition 271 Jd < tt/A/ can only be satisfied when M = 1 
which is not a viable strategy. However if the fading is slowly- 
varying and hence the value of a is close to 1, the Doppler 
spectrum Sh{e^^) decreases sharply for large frequencies and 
most of the energy is accumulated at low Doppler frequencies. 
We can easily find that the frequency ranges [— 7r/49, 7r/49], 
[— 7r/9,7r/9] and [— 7r/4,7r/4] contain more than 90 % of the 
power when a = 0.99, 0.95, and 0.90, respectively. Hence, 
if M < 49, 9, and 4, respectively, in these cases, the impact 
of aliasing will be negligible. Otherwise, ignoring the effect 
of aliasing will decrease the error variance and hence the 
achievable rates under this assumption will be higher than 
those obtained when aliasing is considered. 

In the Gauss-Markov model, the error variance for the 
noncausal Wiener filter can easily be obtained from (|6]l. In 
order to obtain the error variance for the causal filter in 
the absence of aliasing, we have to perform the canonical 
factorization. We begin with rewriting ([8]) as 
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where 



1 - ue-^"" 
1 - ae-^^ ' 



From (l20b . we can deduce that 

(21) 

where 



From (l2Tl i. we can write 

c+ Jc? - Aa^cjf^ 
= ^ 



and w = ^ (22) 



where < it < 1 and r/ > 0. After the canonical 
factorization, we can write 
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u{\ — ua) 
The anti-causal part can be written as 

Shje'""'^"^) \ _ (1 - e^""'^^ 



After making a change of variables, we have 
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F*(eJ'") J _ (1 - ua) (1 - ■ue^"') 
VI. Numerical Results 
A. Optimal Parameters and Effects of Aliasing 

In this section, we present our numerical results. Initially, we 
consider noncausal Wiener filtering and jointly optimize the 
training period, and data and pilot symbol power allocation. 
Moreover, we study the effects of aliasing in the under- 
sampled channel Doppler spectrum. In Figure [T] we plot the 
achievable rates as a function of the training period when 
a = 0.99, i.e., when the channel is changing very slowly, for 
SNR values of 0, 5, 10 and 20 dB. In this figure, plotted curves 
are obtained with optimal pilot and data power allocation. 
The dotted lines give the data rates obtained when aliasing is 
taken into account. Solid lines show the rates when aliasing is 
ignored. As seen in Fig. [1] when SNR is small, the difference 
between the dotted and solid lines is negligible. As SNR 
increases, the difference between the lines is also increasing. 
From this, we can conceive that the effect of aliasing is also 
increasing with increasing power When a = 0.99 and aliasing 
is taken into account, the optimal training periods are 16, 15, 
12 and 7 for SNR values of 0, 5, 10 and 20 dB, respectively. 
On the other hand, when aliasing is ignored, we have optimal 
values as 25, 21, 16 and 8. Hence, the optimal training period 
decreases as SNR increases and aliasing is considered. 

In Figure |2] we plot the achievable rates when a — 0.90. 
Comparing Figs. [T] and |2] we observe that aliasing has a 
more significant impact as a decreases. This is expected since 
aliasing increases in a faster changing channel and hence 
ignoring aliasing provides a looser upper bound. When a = 
0.90 and aliasing is taken into account, the optimal training 
periods are 7, 6, 5 and 4 for SNR values of 0, 5, 10 and 20 
dB, respectively. When aliasing is ignored, the optimal values 
are 5, 5, 4 and 4, respectively. As before, the optimal period 
is decreasing and the effect of aliasing is increasing with the 
increasing SNR. 

Figure |3] and Figure |4] are the bar graphs providing the 
optimal training and data power allocation for a — 0.99 and 
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0.90, respectively, when the training period is at its optimal 
value. In the graphs, the first and the last bars give the power 
of the pilot symbols and the ones in between represent the 
data symbol power levels. These bar graphs are obtained 
when the effect of aliasing on the channel estimation is taken 
into account. We can immediately observe from both graphs 
that the data symbols farther away from the pilot symbols 
are allocated less power because the error in the estimation 
increases with the distance to the pilot symbols. In Fig. |3] the 
decrease in the allocated power is small since the channel is 
very slowly varying and estimate error is almost independent 
of TO. On the other hand, the decrease is more obvious when 
the channel changes faster as evidenced in Fig.|4] Furthermore, 
comparing Figs. [3]and|4l we see that when the training period 
value is high, more power is allocated to the pilot symbol, 
enabling the system to track the channel more accurately. 

B. Causal Filter Performance in the Absence of Aliasing 

In this section, we study the performance when a causal 
Wiener filter is employed at the receiver Since it is rather 
difficult to obtain the canonical factorization of arbitrary 
spectrums, we only consider cases in which the channel is 
slowly varying and the aliasing effect can be ignored. In Figure 
|5] we plot the achievable rates as a function of the training 
period for a = 0.99 when noncausal and causal Wiener 
filters are used. We compare the results when SNR=: 0, 5, 10 
and 20 dB. The dotted lines provide the rates for the case 
of the causal filter and the solid lines show the results for 
the case of the noncausal filter. We observe that the optimal 
training periods are 44, 29, 19 and 9 for the causal filter when 
SNR= 0, 5, 10 and 20, respectively. For the noncausal filter, 
the optimal periods are 25, 21, 16 and 8 for the same SNR 
values. We observe from the plots that the performance of 
causal and noncausal filters are very close. In Figure |6] we 
plot the achievable rates as a function of SNR at optimal 
periods obtained by using causal and noncausal filters. Again 
the performances are very similar Moreover, after 45 dB, the 
rates are the same for both filters. Therefore, for the ranges of 
parameters considered in these figures, causal filter should be 
preferred over the noncausal one. 

In systems where energy is at a premium, the energy 
required to send one bit of information is a metric that can 
be adopted to measure the efficiency of the system. The 
least amount of normalized bit energy required for reliable 
communications is given by ^ = cfsN^R) where C{SNR) 
is the channel capacity in bits/symbol. In our setting, we use 
the achievable rates and analyze the required bit energy levels. 
In Figure |2l we plot the bit energy levels. The dashed and 
solid lines show the results for causal and noncausal filters. 
Note that the minimum bit energies are achieved at SNR = 
-4dB and -3dB for noncausal and causal filters, respectively. 
Operating below these SNR levels should be avoided as it 
only increases the required energy per bit. Figure [8] shows 
the optimal training period values as a function of SNR for 
both filters. Interestingly, the optimal period is increasing as 
SNR decreases for the causal filter while it first increases 
and then decreases when the noncausal filter is used. Since 
both past and future pilots are used when a noncausal filter 




Fig. 1. Achievable rates when a = 0.99 for SNR=0, 5,10, and 20 dB. The 
dotted Hnes provide rates when aliasing is taken into account, and the solid 
lines give the rates when ahasing is ignored. 




Fig. 2. Achievable rates when a = 0.90 for SNR=0, 5,10, and 20 dB. The 
dotted lines provide rates when aliasing is taken into account, and the solid 
lines give the rates when ahasing is ignored. 



is employed, having large training periods will diminish the 
benefits of future pilots especially for the data symbols in the 
middle. Therefore, this option is avoided in this case. On the 
other hand, having a larger period in the causal filter case 
enables the system to put more power to the pilot by not 
using data symbol slots farther away from the pilot and hence 
to obtain more accurate channel estimates. In both filters, as 
SNR increases the optimal period value stays constant at 5. 

VII. Conclusion 

We have studied pilot-assisted communications when causal 
and noncausal Wiener filters are employed at the receiver 
for channel estimation. We have obtained achievable rate 
expressions by finding the error variances in both cases. 
Subsequently, we have jointly optimized the training period 
and power, and data power levels. We have analyzed the 
effects of aliasing on the data rates in Gauss-Markov Rayleigh 
fading channels when noncausal filters are used. We have 
provided numerical results showing the optimal parameters. 
We have compared the performances of causal and noncausal 
Wiener filters at different SNR values. We have also studied 
the energy-efficiency of pilot-assisted modulation with both 
filters. 
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Fig. 3. The optimal power distribution among tlie pilot and data symbols 
when a = 0.99 and SNR=OdB. The optimal period is 16. 
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Fig. 4. The optimal power distribution among the pilot and data symbols 
when a = 0.90 and SNR=OdB. The optimal period is 5. 



Fig. 8. Optimal period vs. SNR(dB) for causal and noncausal filters when 
a = 0.99. 




Fig. 5. Achievable rates vs. training period when noncausal and causal filters 
are employed at the receiver, a = 0.99 and SNR=0, 5,10, and 20 dB. The 
solid lines give the rates when a noncausal filter is used and the dotted lines 
show the rates when a causal filter is used. 
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Fig. 6. Achievable rates vs. SNR when noncausal and causal filters ai'e 
employed at the receiver a = 0.99. The dashed line gives the rate when a 
noncausal filter is used and the solid line shows the rate when a causal filter 
is used. 
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